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Abstract. Motivated by an evolutionary biology question, we study the following 

problem: we consider the hypercube {0, 1} L where each node carries an independent 

random variable uniformly distributed on [0, 1], except (1, 1, ... , 1) which carries the 

value 1 and (0, 0, ... , 0) which carries the value x G [0, 1]. We study the number 9 of 

£/■) paths from the root (0, 0, . . . , 0) to the opposite corner (1, 1, . . . , 1) along which the 

r- H values on the nodes form an increasing sequence. We show that if the value on the 

root is set to x = X/L then 0/L converges in law as L — > oo to e~ x times the product 

On of two standard independent exponential variables. 

5^ As a first step in the analysis we study the same question when the graph is that 

of a tree where the root has arity L, each node at level 1 has arity L — 1, . . . , and the 

nodes at level L — 1 have only one offspring which are the leaves of the tree (all the 

leaves are assigned the value 1, the root the value x £ [0, 1]). 

CO 

P^ 1. Introduction 

We consider the following very simplified model for the evolution of an organism. The 
genetic information of the organism is encoded into its genome which, for our purposes 
is a chain of Lq sites. With time, the organism accumulates mutations. 

If we suppose that there are only two possible alleles on each site, it makes sense when 
looking at the genome to only record whether the allele carried at a given site is the 
original one or the mutant. We will represent a genetic type by a sequence of 0's and l's 
of length Lq where we put a at position i if this site carries the original code or a 1 if 
it carries the mutant. Hence, a genetic type is a point a G {0, 1} L °, the Lo-dimensional 
(S) hypercube. 

As an organism evolves by successive mutations, its genetic type travels along the 
■^- edges of the hypercube. We suppose that each genetic type a G {0, 1} L ° is characterized 

by a certain fitness value x a . We assume that the population is in a regime with a low 
mutation rate and strong selection; this means that when a new genetic type (mutant) 
appears in a resident population one of two things can happen: either the mutant type 
. £h has better fitness and it fixates (i.e. it invades the whole population and becomes the 

resident type), or its fitness is lower and it becomes extinct (i.e. no one in the population 
carries this type after some time). Therefore, in that low mutation and strong selection 
regime, the only possible evolutionary paths are such that the fitness is always increasing. 
We say that such paths are open. 

Somewhere in the Lo-dimensional hypercube, there is a type with the highest fitness. 
We call L the distance between that type and the original one; i.e. L is the number of 
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mutated alleles in the fittest type. A natural question is whether there is an open path 
from the original type to the fittest. Such a path has at least L steps but may contain 
many more. Because of the low mutation rate, evolution takes time, and we interest 
ourselves here only in the shortest open paths leading to the fittest type, that is in the 
paths with exactly L steps, for which mutation never goes back: a site can only change 
from the original type to the mutant one. 

In that setting, it is sufficient to consider the L-dimensional hypercube which contains 
(as opposing nodes) the original type, noted gq = (0, 0, . . . , 0) and the fittest type ctl = 
(1,1,1,. ..,1). We consider paths through that hypercube along the edges which always 
move further away from the origin (i.e. at each step a is changed into a 1 in the sequence) 
and, out of the L! possible paths, we wish to count the number 9 of open paths, that is 
the number of paths such that the fitness values form an increasing sequence. In biology, 
paths with increasing fitness values are also referred to as selectively accessible (see [8] , 

EMU)- 

To count the number of open paths, we need to know the fitness values of all the 
nodes. As a zero-model, we consider the case where the fitness values are random 
variables chosen independently with a common distribution, the so called "House of 
Cards" model (jl], [3]). As we are only interested in whether a sequence is increasing or 
not, the results will not depend on the specific distribution (as long as it is atomless). 
We therefore choose to give a fitness 1 to the fittest node and to assign uniform random 
numbers between and 1 to each other nodes. 

To summarize, motivated by this simple model, the problem we consider is the fol- 
lowing: given an L-dimensional hypercube with L large, how many oriented paths are 
there from oo = (0, 0, ... ,0) to ox = (1, 1, . . . , 1): 

O~0 ->■ 01 ->■ 0-2 ->■ • • • ->■ &L, 

where each ctj+i is obtained from Uj by changing a single into a 1 in the sequence <Tj, 
such that the fitness values x a form an increasing sequences: 

•X-cj-q \ J>ai \ ' " " \ X<j^ •) 

where x aL = 1 and where all the other x a are assigned random independent fitness values 
uniformly chosen in [0, 1]? A variant to this model which we also consider is when the 
starting point is assigned a non-random fitness value x ao = x with x given. 

The correlation structure of the hypercube raises significant technical challenges. As 
a first step, we study the following simplified problem: instead of working on the L- 
dimensional hypercube we chose to work on a deterministic rooted tree as in Figure [T] 
with arity decreasing from L to 1: the root is connected to L first level nodes, each first 
level node is connected to L — 1 second level nodes, etc. There are L levels in the tree 
and L! directed paths. The number of possible steps at level k is then L — k, as on the 
hypercube. Each of the L! leaves of the tree (at level L) are assigned the value 1. All 
the other nodes are assigned independent random numbers uniformly drawn between 
and 1, except perhaps the root to which we may choose to give a fixed value x. We are 
interested in the number 9 of directed paths on the tree going from the root to one of 
the leaves where the numbers assigned to the visited nodes form an increasing sequence. 
As before, such a path is said to be open. 
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Figure 1. A tree for L = 4 and x = 0. The bold red lines indicate the 
directed paths going down from the root (at the top) and which visit an 
increasing sequence of numbers. There are 9 = 2 paths going all the way 
down to the leaves of the tree. 

We mention that other models of paths with increasing fitness values on trees have 
also been considered in the literature [BJ. 



2. Main results 

Throughout the paper we use the following notations for the probability of an event 
and for the expectation and the variance of a number: 

P x (-), E x (-), Var x (-) when the root has value x 

¥*(■)= [ dxP x '(-), E*(-)= / da: !*(•), Var*(-) when the root is uniform in [0, 1] 
Jo Jo 

Note that the size L of the system is implicit in the notation. Whether we work on the 

tree or on the hypercube will always be made clear from the context. We call 9 the 



number of open paths. The notation a^ ~ hi, L — > oo means lim^^oo 






1. 



Obtaining the expectation of the number of paths for a given x is easy: there are 
L\ paths in the tree or the hypercube. Each path has probability (1 — x) that 
the L — 1 intermediate numbers between the root and the leaf are between x and 1. 
Furthermore, there is probability 1/(L — 1)! that these intermediate numbers form an 
increasing sequence. Hence, the probability that a given path is open is (1— x) j{L— 1)! 
and thus 

E x {6) = L{l-x) L - 1 (2.1) 

both for the tree and the hypercube. 

Thus, if x = there are on average L open paths, and if x > the number of open 
paths goes in probability to zero when L becomes large. 

The most biologically relevant variant of the model is when x is also randomly picked 
as the other nodes. The expectation of 9 (both on the tree and on the hypercube) is 
trivially obtained by integrating (2.1): 



E*(0) = 1, 



(2.2) 



but the typical number of paths for L large is not of order 1 , as can be seen by looking 
on the variance of 9: 

Var*(0) 



lim 

L— >oo 



L 



1 



on the tree. 



(2.3) 



4 JULIEN BERESTYCKI, ERIC BRUNET, AND ZHAN SHI 

(All the variance computations on the tree are carried out in sectionpl) In fact, this can 
be understood by considering starting values x scaling with the size L of the system as 
x = X/L with X fixed: 

Proposition 1. In the case of the tree, 

lim Et($/L) = e~ x , lim V&A(9/L) = e~ 2X . (2.4) 

In the case of the hypercube (Hegarty-Martinsson [2] ) 

lim E*(0/L) = e~ x , lim Var$ (<9/L) = 3e' 2X . (2.5) 

(Note that Hegarty-Martinsson consider a different scaling regime, but their proof can 
be adapted without any modification to the result above.) 

Since the variance in Proposition [T] scales like the square of the expectation, it means 
that when the starting value is 0(1/ L) there are O(L) open paths in the system. 
When the starting value is chosen randomly, there is a probability 0(1/ L) that it is 
in fact 0(1/ L) yielding 0(L) open paths. On average we thus expect from these events 



alone 0(1) open paths, with a variance O(L), as in (2.3). 
This observation can be made more precise: 

Theorem 1. On the tree, for a starting value x = X/L, the variable 9/L converges in 
law when L — > oo to e~ x multiplied by a standard exponential variable. 

Theorem 2. On the hypercube, for a starting value x = X/L, the variable 0/L converges 
in law when L — >• oo to e~ x multiplied by the product of two independent standard 
exponential variables. 

It will become apparent in the proofs that we get a product of two independent vari- 
ables on the hypercube because, locally near both corners (0, 0, ... , 0) and (1, 1, . . . , 1), 
the hypercube graph looks roughly like the tree. 

We conclude with the following remark: when the starting value x is picked randomly, 
even if the expectation and the variance of 9 is dominated by values of x = 0(1/ L), 
the probability that there exists at least one open path is dominated by starting values 
x = In L/L + 0(1/ L). This was made clear on the hypercube in [2j and we here state 
the tree counterpart. 

Theorem 3. On the tree, when the starting value is x = (InL + X)/L 

InL+X , v InL+X , . r, v v 

lim E~ ^(6») = e~ x , lim Var~ ^(6») = e~ 2X + e~ x . (2.6) 

L— >oo L— >co 

When the starting value x is chosen at random uniformly in [0, 1], the probability to have 
no open path goes to 1 as L — ^ oo and 

P*(#>1)~^ asL^oo. (2.7) 

L 

The rest of the paper is organized as follows: we start by proving Theorem [3] in 
Section [3] as it is the simplest, and then we prove Theorem [T] in Section [4j The proofs rely 
on Proposition[T]which is itself proven in Section|5]for the tree. In Section[6j we introduce 
the notion of Poisson cascade, which allows to give a probabilistic interpretation of one 
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of the main object introduced in our proofs. Finally, Theorem p] (on the hypercube) is 
proven in Section [7j 

3. Proof of Theorem [3] 



In (2.6), the result on the expectation is trivial from (2.1), and the result on the 
variance is obtained in Sectional In this section, we prove (2.7). 

Let us start with the upper bound. Markov's inequality with (2.1 ) leads to 

F x (9> 1) <inin[l,L(l-a;) L_1 ]. (3.1) 

> 1) = Jo F x (9 > 1) dx at xo = 1 - exp[-(ln L)/(L - 1)] since 
1. We end up with 



We split the integral P 
that is the point such that L(l — xq 

InL 



.2,-1 



> 1 < 1 



exp 



L-l 



+ exp 



L 



L-l 



InL 



InL 



L 



+ 



°(i) 



(3.2) 



We now turn to the lower bound. Let L t-t f(L) be a function diverging more slowly 
than InL: 

lim/(L)=oo, 0</(L)<lnL, 1; 



L— >oo 

It is sufficient to show that 



lim 
1,-kx) In L 



0. 



lnL-f(L) 

lim P L | 



>D 



1. 



because 



>D> 



In £,-/(£) 



InL— f(L) In L-/(L) , 

> 1) dx > ^A^p — z — (Q > l) 

Li 



(3.3) 
(3.4) 

(3.5) 



where we used that x \— >¥ x (9 > 1) is a non-increasing function. Taking L large in (3.5) 
assuming (3.4) gives the lower bound. 



It now remains to show ( |3.4[ ). We consider a tree started from x = [InL — f(L)]/L, 
and call m the number of nodes at first level with a value between x and (In L)/L. 
This number m is a binomial of parameters L and f{L)/L. Conditionally on m, the 
probability to have no open path in the tree is smaller than the probability to have no 
open path through these m specific nodes. Thus by summing over all possible values of 
m we get: 

, 0) <j:( L ](mT(i-mY' m 



lnL-f(L) 



m=0 



,m) \ L J 



L J 



InL . 



L-l 



(0 = 0) 



(3.6) 



where we used ¥ x (9 = 0) < p( lnL )/ L (# = 0) for x < (In L)/L. Note the obvious extension 
to the notation to mark that the probability on the right hand side is for a tree of size L—l 
and not L as on the left hand side. Summing (3.6), one gets 

f(L) 



In L-f(L) 



0)< 



iL 



1 



L 



1 



;L-1 



(9 = 0) 



(3.7) 



But from Cauchy-Schwarz (applied to 9 and 1(9 > 1)) and (2.6), which is proved in 
Section 15.31 one has 



InL 

'TTl 



>1)> 



E^(g) 2 
E^(02) 



1 
1 — > ^ 



(3.8) 
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so that, for L large enough 

In L.t 

and thus, for L large enough 



L ~ 1 (0 > 1) > F J ^=T 1 '' L ~ 1 (9 > 1) > 0.33, 



InL-f(L) 



o)< 



1 



f(L), 



L 



Hi. Li , T 1 , 

— ;L - 1 re > i 



< 



1 - 0.33 



f(L) 
L 



(3.9) 
(3.10) 



which goes to zero as L — > oo, as required. 



4. Proof of Theorem [Q 

In this section, we consider the case of the tree with a starting value x which scales 
as x = X/L, X > being a fixed number. The natural starting point for a proof would 
be to introduce G, the generating function of 9: 

-A6^ 



G(\,x,L) =E x (e- xe ) 



(4.1) 



with parameter A > 0, for which it is very easy to show from the tree geometry (each of 
the L nodes at first level are the root of an independent tree of size L — 1) that: 

L 



G(A,x,l) = e 



G(X,x,L) 



+ / dyG(\,y,L-l) 

J X 



for L > 1. 



(4.2) 



However, extracting the limiting distribution directly from (4.2) seems difficult because 



the number of levels and the size of each level increase together and because the fixed 
point equation does not give the A dependence of the result. We shall rather use an 
idea which proved to be very generic and powerful in branching processes: the value of a 
random variable is decided during the early stages of a branching process; at later stages 
the law of large numbers kicks in (see for instance |5j). 

Assume that all the information at the first k levels of the tree is known, and call 9 k 
the expected number of paths given that information: 

e k = E{6\F k ), (4.3) 

where J-/, is the available information up to level k. For instance, consider the tree of 
Figure HI up to level k = 2. There are three paths still open with end values (at level 2) 
given by 0.22, 0.66 and 0.95. Therefore from (|2~Tj) , 6 2 = 2(1 -0.22) + 2(1 -0.66) +2(1 - 
0.95) = 2.34. Similarly, 6>i = 3(l-0.59) 2 + 3(l-0.90) 2 + 3(l-0.01) 2 + 3(l-0.83) 2 = 3.56 
and #3 = 64 = 6 = 2. A general expression of 0t for k < L is 

9k =T, l| ff o P cn}(£ ~k)(l- x a ) L ~ k -\ (4.4) 



E 

\a\=k 



where we sum over all nodes a at level \cr\ = k in the tree, x a is the value of the node 
a and the event {a open} is the J-"^! -measurable event that the path from the root to 
node a is open. 

Heuristically, when k is small, there are few paths open up to level k and the variance 
of 9 given Tu is large: 9 has no reason to be close to 9 k . When k is large, however, 
there are many paths open up to level k which all contribute to the value of 9. The 
law of large numbers leads (on the good scale) to a small variance of 9 given J- k and 
9k becomes a good approximation of 9. The advantage of this approach is that one can 
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take the L — > oo limit for a fixed k (keeping the depth of the tree constant) and then 
take the k — > oo limit. 

Our proof consists then in two steps: 

• first we show that 



lim P§ I - < ; 

L— >oo V L 



lim lim Fl I — < z 
k— >oo L— >oo V L 



9k 



(4.5) 



which means that 9/L for a starting point X/L and L large has the same distri- 
bution as 9k/ L with the same starting point for large L and then for large k; 
• then we make use of a generating function similar to (4.1) to show that the 



distribution of 9k/ L after taking the limits is given by an exponential law. 



4.1. Proof of (4.5). Pick S > 0. Observe that 

(9 k 



L 



< z 



L 



< z 



T k ) <l(f<z + 5 
i 7 /,) >t(j-<z-5 



.F fc )+P( 

T k 



L 



>S 



9 k \ 



L 



>5 






(4.6) 



The first inequality follows from the simple remark that it is obviously true when 9k/ L < 
z + 5 and that, when 9k/ L > z + 5, it is necessary to have (9k — 9)/L > 5 to get 9/L < z. 
Similarly, the lower bound in the next line is trivial when 9k/ L > z — 5 and it is sufficient 
to have (9 — 9k) /L < 5 when 9k/ ' L < z — 5. 

As 9k is the expectation of 9 given J 7 ^, one has from Chebyshev's inequality: 



9 k \ 



L 



>5 



Fk\ < 



Var(g|J- fc 
L 2 5 2 



We use (4.7) into (4.6) and then take the expectation (over J~k)'- 

^(®k 



X /t/u 



(°i^-t) 



ET\Vav(9\^ k )} x ( 9 \ 



Therefore to show (4.5), it is sufficient to have 



lim limsup-^E$[Var(6»|7' fc )] = 0, 

/c->oo L-J.OO Li 



(4.7) 



Er[Var(g|^ fc )] 
L 2 5 2 



(4.8) 



as well as the existence and continuity of the right hand-side limit of (4.5). 



In Section 5.5 we will show by direct analysis of the second moment that 



1 



-2X 



lim -^Eir[Var(0|Jfc)] 



(4.9) 



which yields (4.8). We will now compute the distribution of 9k/ L in the double limit 
L — > oo and k — > oo and, as the result will be a continuous function of z, this will 
complete the proof. 
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4.2. Distribution of 9 k - Similarly to (4.1), we define Gk(X,x,L) the generating func- 
tion of 6k for a tree of size L and a value x at the root: 

G k (X,x,L)=E 

x) L_1 one has 

\L-1 



* (e- xe *) 



As 9 = E x (9) = L(l 



Gq(X, x, L) = exp — AL(1 



(4.10) 
(4.11) 



and the recursion relation 



G k (X,x,L) 



x 



+ f dy G fc _i(A, y, I - 1) = 1 - / dy (1 - G fc _!(A, y, L - 1)) 

Jx Jx 



(4.12) 
to be compared to (4.2). 

This relation is obtained by decomposing on what happen at the first splitting. For 
a node a connected to the root let #fc(er) be the conditional expectation given J- k of the 
number of open paths going through a. The {9k(&)}\cr\=i is a collection of L independent 
J-fc-measurable independent variables, hence: 

G k (\,x,L) = [E*(e- xe ^)] L , (4.13) 

where a is a given node in the first generation. 

Let us evaluate E x (e~ A6,fe ( CT )). If x a < x then 9 k (a) = and since this event has 
probability x it contribute xe~ xo = x to the expectation. With a probability dy for 
y £ [x, 1] the value at the node is y > x and some paths might go through that node. 
The subtree rooted at a is like the initial tree but of dimension L — 1 and we want to 
evaluate the average number of paths in that subtree given the information after k — 1 
steps, hence the term in the integral of (4.12). 



The strategy is to take the L — > oo limit at fixed k in (4.11 ) and (4.12 ) after a proper 
rescaling, then to let k — >■ oo. We only consider A > 0; it is sufficient to characterize the 
distribution, and it simplifies the arguments below. 



Step 1. We first show that the following limit exists (for \i > 0): 

H X 



Vo, b, 
and that the limit satisfies 



G k 



L + a L + b' 



L 



->G k (n,X), 



(4.14) 



G k {n,X) =exp 



x 



[i-G fc _i(/x,y)]dy 



Gq(h,X) = exp [ - /xe~ x ] . (4.15) 



From (4.11), it is obvious that (4.14) holds for k = with the limit given in ( 4.15| ). 
Choosing k > 0, we assume that (4.14) holds for Gk-x- Then, after a change of variable 
in d4~T2|), 



G k 



/' 



X 



L+a L+b 



L 



1 



1 



L + bJx 



L+b 



dY 1 



Gfc_if 



/i 



Y 



L+a L+b 



,-L-l 



The Gjt-i on the right hand side has a L — >■ oo limit. From its definition (4.10), one has 
1 > G k {X,x,L) > 1 - XE x {9 k ) = 1 - XE X {9) = 1 - AL(1 - x) L ~ x . (4.16) 
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Then, assuming /i > 0, for all a and b, one has for L large enough (depending on a and 

by. 



l>G k . 



, L - 1 > 1 - 2ne 



-Y/2 



L + a L + b 1 
Thus, from the dominated convergence theorem, we have 



(4.17) 



L+b 



&Y 1 



x 



( I » Y T 



1 



> 

L—>oo 



X 



dy(i-G fc _i(/i,y)), (4.18) 



and thus (4.14) holds for G k with the relation (4.15). 



Step 2. The fact that (4.14) holds means that when starting with x = X/L, the random 
variable 0/./ L has a well defined limit as L goes to infinity, and that the generating 
function of that limit is G k . We now use the recurrence (4.15) to take the k — > oo limit 
which will show that liniL_-. 00 9k/ L converges (when k — > oo) to an exponential variable. 
This task is greatly simplified by noticing (by a simple recurrence) that one can write 
Gk as a function of one variable only: 

G k (fi,X) = F k ^e- x ) 

with 

r l - Fk-i (z') 

Jo z> 



-dz 



F k {z) = exp 
We shall show that the solution to (4.20) satisfies 

F k {z) 



Fo{z) 



(4.19) 
(4.20) 



1 



1 + z 



for z > — 1, 



(4.21) 



which implies that lim^^oo 9 k /L converges weakly when k — > oo to an exponential 
distribution of expectation e~ . Note that we only need to consider z > and, in 
fact, we proved (4.20) only for z > 0, but (4.21) holds for the solution to (4.20) for 
z G ( — 1, oo). 



Defining 5 k (z) for z > — 1 and z ^ by 



F k {z) 



1 



Sk(z) 



(4.22) 



l + z (1 + z) 3 2 k 
it is easy to see that there exists a constant M such that for all k and all z > — 1 

< 5 fe (z) < M. (4.23) 

Indeed, for fc = 0, 

(l + z) 3 



S (z) 



1 



(4.24) 



z' \l + z 

5o(z) > for z > — 1 because e z > 1 + z by convexity. Furthermore, <5o(z) can be defined 
by continuity at z = 0, has a limit in z = +oo and in z = — 1 and reaches therefore a 
maximum M on (— l,oo), which initializes (4.23). 



Assuming now (4.23) at order k — 1, one has 

i r r* z' «*_!(*') 



*Mz) 



i + z 



exp 



1 + * 



A3 



)fe-l 



dz' 



(4.25) 
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leading to 
1 



, L >F k (z)>—- 
1 + Z 1 + z 



M 



[l + z 



A3 



dz' 



1 

1 + z 



M z 2 

W-l 2(1 + 



which gives ( |4.23 ) at order k. Hence the limit (4.21 ) holds. This completes the proof of 
Theorem [U 



5. Results on the second moment for the tree 



The goal of this section is to prove the second moment results (2.3), (2.4), (2.6) and 
(4.9) which were used in the proofs of Theorems IT] and [3] 



5.1. Exact expression of the second moment. The expectation of 9 2 is the sum, 
over all the pairs of paths, of the probability that both paths are open. There are L\ 2 
pairs of paths in the system. For a given pair, the probability that they are both open 
depends on the number q 6 {0, 1, 2, . . . , L — 2, L} of bonds shared by the paths. (Nota: 
two paths may not have exactly L — l bonds in common.) The following facts are clear: 

• the number of pairs of paths which coincide all the way (q = L) is LI; 

• the probability that "both" paths in such a pair are open is (1 — x) j{L — 1)!; 

• the number of pairs of paths which coincide for q = 0, 1, . . . L — 2 steps and then 
branch is L\(L-q-l)(L-q-l)\; (Remark: 1 l!+2 2!+3 3! + - • - + (L-1)(L-1)! = 
LI — 1, hence one recovers that the total number of pairs of path is LI 2 .) 

• the probability that both paths in such a pair are open is 



(! _ x )2L- 9 -2 / 2 L-2g-2 N 
(2L- g -2)! I L-q-1 , 



(5.1) 



Indeed, excluding the starting and end points, there are 2L — q — 2 total different 
nodes in such a pair of paths. All these nodes must be larger than x, hence the 



(1 



\2L q 2 t erm _ This is however not sufficient because the values on the 



nodes must be correctly ordered. Out of the (2L — q — 2)! possible ordering (see 
the denominator), the only good ones are those such that the q smallest terms 
are well ordered in the shared segment (only one choice), and the 2L — 2q — 2 
remaining terms are separated into two well ordered blocks of L — q — 1 terms, 
one for each path; the only freedom is to choose which terms go to which path, 
hence the binomial coefficient. 



This leads to 



where 



L-2 



E 



Xfn2\ 



J2a(L,q)(l-x) 2L -o- 2 + L(l-x) 

g=0 



L-l 



a(L,q) 



L\{2L-2q-2)\ 
{L-q-2)\{2L-q-2)V 



(5.2) 



(5.3) 



The isolated term in (5.2) corresponds to the pairs of identical paths and is equal to 

W{9). 
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5.2. Estimates and bounds on the a(L,q). Expanding the factorials in a(L,q), one 
gets 



a(L,q) 



29 (l " *) (l 



<?+3 
2L 



)-(l"¥) 



(5.4) 



(5.5) 



From this expression, one gets the following equivalent when L — > oo and g <C v L: 

a(L,g) = ^[l + 0(g 2 /L)]. 

For g close to L, one has the values a(L, L — 2) = 2, a(L, L — 3) = 24/ X, a(L, L — 4) = 
360/[(L + l)(L + 2)], etc. 

We want to find a good upper bound for a(L, q). We first show that q i— >■ hi a(L, g) is 
a convex function for L > g + 3. Indeed 

(L-q-l)(2L- 



In a(L, 



In a(L, g — 1) 



In 



1 



(2L - 2q)(2L - 2q - 1) 



(5.6) 



so that 



[In a(L, q) - In a(L, q-1)]- [In a(X, g - 1) - In a(L, q - 2)] 

(L-q- 1)(2L - g - 1)(2L - 2g+ 2)(2L - 2g+ 1) 



ln (2L-2q)(2L-2q-l)(L-q)(2L-q) ' ^ 

Since the denominator is clearly positive as soon as L > q + 1 we see that lna(L, q) is 
convex if in (5.7) the numerator is bigger than the denominator. This condition leads to 

(L - g) 2 (2L - 1) - (2L - q - 1)(2L - 2g + 1) > 0, 



which holds as soon as L — q > 3. 

Assume L > 12 so that a(L, L — 3) < 2 and let 

~ln(L 2 ) 



qo(L) 



ln2 



+ 1 



From (5.4) one has 



a(L, i 



< 



1/ 1 



(5.8) 



(5.9) 



(5.10) 



Applying this to q = qo(L), one easily gets a(L, qo(L)) < 1 for L large enough. (The 
term L 2 /2 qo is smaller than 1/2, and the parenthesis converges to 1.) 
By using the convexity of lna(L, g), one has 



ln a(L, 



< 



lna(L,0) + 
hi 2 



qo(L) 



lna(L,g o (0) -lna(L,0) for 0< q<q (L), 

for q (L) <q<L-2. 



But lna(L, go(-L)) < so that 

a(L, 0) exp 



a(L, 



< 



lna(L,0) 
90 (L) 



for0<g<g (i), 
for q (L) <q<L-2. 



(5.11) 
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Remark now that a(L, 0) = L{L — 1) < L 2 and In a(L, 0)/qo(L) — > In 2 as L 
implies that, for L large enough, hxa(L,0)/qo(L) > In 1.99 and that 

(T .\L 2 l.m-* for < q < q (L), 
a(L,q) < < 

12 for q (L) <q<L-2. 



oo. This 



(5.12) 



5.3. Proofs of the limits (j^Sj), §2A§ and §2J$ of Var(0). The second moment fl5T2[ ) 
is written as a sum from q = to q = L — 2. To prove the various limits we need, the 
strategy is always the same: 

(1) Split the sum over q into two parts; one going from to qo{L) and one going 
from qo(L) + 1 to L — 2. 

(2) In the first sum, replace a(L,q) by its equivalent (5.5); this is justified with the 
dominated convergence theorem, using the bound (5.12). 



(3) Show that the second sum do not contribute using the bound (5.12) 
Ii 

Var*(#) E*{6 2 )-E*{9y 



Proof of (2.3). Integrating (5.2) over x and using E*(#) = 1, one gets 

qo(L) 



L 



^ a(L, q )/L 

q=0 



L-2 



1 



Y- a(L,q)/L 2 ^ 

Z^ o g+1 "•" Z^ 



a(L,q)/L 



g =0 



2+1 

L 



2L 



1 



q=qo(L)+l 

In the first sum, the running term is equivalent to 2~ q j2 when L is large and is dominated 
by 1.99™ 9 for L large enough. Therefore, this first sum converges to 1. In the second 
sum, the running term is smaller than 2/L 2 , implying that the whole second sum is 
smaller than 2/L and thus vanishes in the large L limit. 

Proof of (2.4). We divide (5.2) by L 2 , replace x by X/L and split the sum: 



X 

Em 



L 2 



qo(L) 

E 

q=0 



a(L,c 
L 2 



■(-§) 



X^2L-q-2 

T 



L-2 

+ E 

q=qo(L)+l 



a(L,q) , X^L- q -2 (1 

l 2 y l) + 



L) 



L 



(5.13) 

The running term in the first sum is equivalent to 2 q e and is dominated by 1.99 q , 
therefore the first term converges to 2e - as L — > oo. The running term in the second 
sum is smaller than 2/L 2 implying that the whole second sum is smaller than 2/L and 
thus vanishes in the large L limit. The isolated term goes also to zero. Therefore, 
the whole expression converges to 2e~ 2 ^ and one recovers the variance in (2.4) after 
subtracting E x / L (6/L) 2 . 



Proof of ( |2.6[ ). We now take x = (In L + X)/L and split again the sum in (5.2 ) into two 
parts: 



E x {9 2 ) 
Using 



qo(L) 

E 

9=0 



a(L,q) 
L 2 



L-2 



xL 2 (l-x) 2L - q - 2 + J2 a(L,q)(l-x) 2L - q - 2 +L(l-x) 
q=qo(L)+l 



L-\ 



lim L 2 ( 

L— >oo V 



1 



lnL + X\2L-g-2 



-2X 



(5.14) 



(5.15) 



L— >oo V L 

into (5.14), the running term in the first sum is equivalent to 2 _9 e~ 2 ^ and is dominated 
by 1.99-"(e- 2X + l) for L large enough (because L 2 (l-x) 2L ~ q - 2 < L 2 (l-x) 2L - qa( - L ^ 2 , 
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which becomes close to its limit when L gets large). Therefore, the first sum converges 
to 2e~ 2X . We write an upper bound of the second sum of (5.14) using a(L,q) < 2 and 
then extending the sum to the interval [0, L — 1]: 



q=qo(L)+l 



x) 2L- q -2 < 2(1 _ X )2L- 2 (1 g) L 1 < 2 (1 ^ * 



-A' 



(1-x)- 1 -1 



InL 



(5.16) 

which goes to zero for L large. Finally, the last term in (5.14) converges to e~ x ; putting 
things together, one finds K^ nL+x >' L (9 2 ) — > 2e~ 2X + e~ x . Removing the expectation 
squared, one recovers (2.6). 

5.4. Exact expression for E x '[Var(0|J r / ! c )]. The number 6 of paths given J-^ is the sum 
over all the nodes at level k of the number of paths through that node. These variables 
are independent; therefore 



Var(f9|J" fc ) = E 1 W o P en} v ( x a, L-k) 

\a\=k 

where v(x, L) is the variance of 8 for a tree of size L started at x 



(5.17) 



L-2 



v(x,L) :=E X {9 2 )-E X {6) 



-L(l - x) 2L - 2 + E < L , ?)(1 " x) 2L ~ q - 2 + L(l 



9=1 



Taking the expectation over J 7 ^, one gets 



dx CT 



x) 



fe-i 



(fc-1)! 



-v(x a ,L- k), 



\L-1 



(5.18) 



(5.19) 



where L\/{L — k)\ is the number of terms in the sum and where the fraction in the 
integral is the probability that a is open given the value of x a > x. 

Performing the integration term by term is easy, one finds, after simplification: 



E^Var^Jfc)] 



a(L, k) 
L-k-1 



(l_ x )^-fc- 2+ J- a{L,q){l-xf L -i- 2 + L{l-x) L -\ 

q=k+l 



(5.20) 

Note that apart from the first term, this is exactly the same as the full variance v(x, L) 
except that the sum over q begins at k + 1 instead of at 1 . 



5.5. Proof of (4.9). We now divide (5.20) by L , set x = X/L and consider L large. 
We only need an upper bound, but it is as easy to calculate the exact limit. As in 
Section 5.3 we split the sum into two parts; one where the index q runs from k to qo(L) 
and one from qo(L) + 1 to L — 2. In the first part, using the dominated convergence 

(5.21) 



theorem with the bound (5.12): 
qo(L) 

L—>oo 



lim > 



q=k+l 



a(L, c 

~1X 



, X\2L-q-2 



q=k+l 



1 



-2X 



1 



-2X 



2i 
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Also using the bound (5.12), the second part of the sum goes to zero: 



-i Li — Z y or ~ o -i 

— 2 Y. < L ^)(l-^) ' < L2 xLx2. (5.22) 

q=q (L) + l 



It is very easy to check that in (5.20) the two isolated terms (divided by L , of course) 



go also to zero, so that one finally obtains (4.9). 



6. A RELATION WITH POISSON CASCADES 

Our model is closely related to cascades of Poisson processes. In fact, the arguments 
we used in Section [3] can be presented in terms of Poisson cascades. Let us make a brief 
description. 



We recall the sequence of functions F^, k > 0, defined in (4.20): 

1 - ^fc-i(^) 



F k (z) = exp 



dz' 

o z' 



F (z) = e- z . (6.1) 



It is clear that Fq{z) is the Laplace transform of the Dirac mass at 1, and that F\ is 
the Laplace transform of Y^h=x Xj, where (Xj, j > 1) is a Poisson process on (0, 1] with 
intensity l (0jl] (x)^. 

We now define a cascade of Poisson processes. At generation k = 0, there is only 
one particle at position 1. At generation k = 1, this particle is replaced by the atoms 
(X- , j > 1) of a Poisson process on (0, 1] with intensity l( 0i ii(x) -^-. At generation 

k = 2, for each j, the particle at position X- is replaced by {X- X- e , £ > 1), where 

(2) A 

{X-^t > 1) is another Poisson process with intensity 1( i](^) (all the Poisson 
processes are assumed to be independent). Iterating the procedure results in a cascade of 
Poisson processes. We readily check, by induction in k, that F k is the Laplace transform 
of Yk, the sum of the positions at the k-th generation of the Poisson cascade. 

What was proved in Section [3] can be stated in terms of the cascade of Poisson pro- 
cesses. Recall from (4.3) that 9k = K(8\Fk)- 



Theorem 4. (i) For any k > and for x = X/L, -j^ converges weakly, when L — V oo, 

to e~ x Y k . 

(ii) When k —> oo, Y^ converges weakly to the standard exponential law. 

7. Proof of Theorem [2] 

In this section, we adapt the methods used in Section [4] to obtain the distribution of 
the number of open paths on the hypercube when L goes to infinity. 

In the large L limit, both the width (the number of possible moves at each step) and 
the depth (the number of steps) on the hypercube go to infinity, which makes studying 
the limit difficult. We worked around that problem on the tree by introducing 9^, the 
expected number of paths given the information Fk after k steps, and by sending first 
L (now representing only the width of the tree) and then k (the depth) to infinity. 

We use the same trick on the hypercube, but with a twist: the hypercube is symmet- 
rical when exchanging the starting and end points, and there is no reason to privilege 
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one or the other. Therefore, we call 9k the expected number of paths in the hypercube 
given the information J- k at the first k levels from both extremities of the hypercube. 



To write an expression for 9 k similar to (4.4), we introduce the following notations: 

• The ( k ) nodes k steps away from the starting point are indexed by a and, as 
usual, their values are written x a . 

• Similarly, r indexes the ( fc ) nodes k steps away from the end point and we note 
their values 1 — y T . 

• riff _ {0, 1, . . . , k\} is the number of open paths from the starting point to node a. 
(Contrary to the tree, there are several paths leading to each node a.) 

• Similarly, m T is the number of open paths from node r to the end point. 

• l(cr <R- t) indicates whether there is at least one directed path (open or not) from 
node a to node r. 

Then, 

°k= E E n a m T t(a^T)(L-2k)(l-y T -x a ) L - 2k - l t(x a +y T <l), (7.1) 

\cr\=k \r\=L-k 

where 1(<7 f-> r)(L — 2k)(l — y T — x a ) L ~ 2k ~ 1 l(x (T + y T < 1) is the expected number of 
open paths from a to r given the values x a and y T . 
Our proof can be decomposed into three steps: 

• First, we show that, as in the hypercube, the distribution of 9/L as L — > oo is 
the same as the distribution of 9k/ L as L — > oo and then k — > oo. 



Then, we show that the double sum in (7.1) can be modified (without changing 



the limit, of course) into a product of two sums. This means that asymptotically 
9k can be written as a contribution from the k first levels (the sum on a) times 
an independent contribution from the k last levels (the sum on r). 
• Finally, we show that each of these two contributions is asymptotically identical 
in distribution to what we computed on the tree. 

7.1. First step: 9k and 9 have asymptotically the same distribution. We show 
in this section that, when the starting point scales with L as x = X/L for X fixed, 

lim % wcaMy > lirn %. (7.2) 

L— >-oo L fc— >oo L— >oo L 



Following the same argument as on the tree, it is sufficient to show that 

lim lin 



lim lim sup —,Et [ Var(6>| F k )] = 0. (7.3) 



First remark that 

E x [Vav(9\T k )] = E x [9 2 } - E x [9 2 k ] (7.4) 

where we used 9k = Ef^lJ^]. 



Second moments as in (7.4) can be written as sums over pairs of paths. For a given 
path a, we call xf the value on the node at step i on path a (0 < i < L, with Xq = x 
and _2 = 1) and £"■ the indicator function that path a is open from steps i to j: 

^ = Hxf < xf +1 < x? +2 < ■ ■ ■ < x°). (7.5) 
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a a 

We now have the following expression for the second moment: 



(7.6) 



(7.7) 



where, by symmetry, we chose one particular arbitrary fixed path which bears the index 0. 
Similarly, 

(7.8) 



W[Bl] = LlJ^E^p^E^lT* 



We write now £q l = £o fc£fc L-k^L-k L- The ^ vs ^ an d ^ as ^ terms are J^-measurable, hence 



E*[$ = £!EE*[M,* E [^*W E [^N &-*,£#-*,£ 



(7.9) 



We make the same decomposition on £q L in (7.7). Writing E x [-] = E^ [E[-|.Ffc]] and 
pushing out of the inner expectation the J^-measurable terms, one gets 



E*[9 2 ] = Ll^E?[% h faE[$ L _ k £ tL _ k \r h ] C L - k ,Ld-k,L 



(7.10) 



Using (7.4), 



E x [Var(e\F k )] =Llx 



(7.11) 



For a given path a, the central term (in parenthesis) in the last expression is a kind of 
covariance. Clearly, if the paths a and do not meet in the interval {k, . . . , L — k}, the 
variables ££ L _ k and Ck L-k are independent and the covariance is zero. Therefore, we 
can restrict the sum over a in (7.11) to the paths which cross at least once the path in 
the interval {k, . . . , L — k}. With this modified sum, we can now find an upper bound to 
(7.4). Dropping all the negative terms and undoing the decomposition of £q l into three 
parts, we get 

E*[Var(0|J" fe )] < LI^'^Kl^l], (7-12) 

a 

where the prime on the sum indicates that a runs only over all the paths that meet 
path at least once in {k, . . . , L — k}. 

We now bound (7.12). Let I p , q be the set of all the paths such that 

• the p + 1 first nodes (including the origin) are the same as for path (in other 
words, the first p steps are the same as in path 0), 

• the next L — p — q — 1 nodes are different from those of path 0, 

• the next q + 1 nodes (thus including the end point) are the same as for path 0. 
By construction, for p < k and q < k, a path in I Ptq do not meet path in {k, . . . , L — k}. 
Therefore 

fc-lfc-l 



E' 



x [v*{e\rk)] < Li£F [tib&j. 



P=0 g=0 a£l p>q 



(7.13) 
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Notice that the first sum is not primed; it runs over all the L\ possible paths a. The 



inequality holds because in ( 7.12 ) we were summing over all the paths except all of those 



not crossing path in {k, L — k}, while in (7.13) we sum over all the paths except some 
of those not crossing path in {k, L — k}. 



The first term in (7.13) is simply E z [# 2 ], see (7.7). From [2] and (2.5) it has the 
following large L limit 

1 



lim AtE£[6> 2 

L->oo L 2 



lim ^LI^ET £ Q f 



4e 



-2X 



(7.14) 



We now focus on the second term. For a path a in I Pt q, a direct calculation shows that 

(1 - x fL- P -q-2 / 2L _ 2p -2q-2\ 
Z (2L-p-q-2)\[ L-p-q-1 l 



E a 






(7.15) 



Indeed, excluding the starting and ending points, there are 2(L — 1) —p — q total different 
nodes in the paths a and 0. All these nodes must be larger than x, hence the (1 — 
x )2L-p-g-2 t erm This is however not sufficient because the values on the nodes must be 
correctly ordered. Out of the (2L — p — q — 2)! possible orderings (see the denominator), 
the only good ones are those such that the p smallest terms be well ordered in the first 
shared segment (only one choice), the q largest terms be well ordered in the second 
shared segment (only one choice), and the 2L — 2p — 2q — 2 remaining terms be separated 
into two well ordered blocks oi L — p — q — 1 terms, one for each path; the only freedom is 
to choose which terms go to path a and which to path 0, hence the binomial coefficient. 
The number of terms in I Ptq is B(L—p — q), where B{n) is the number of permutations 
of n elements such that for any m in {1, . . . , n — 1} the image of {1, ... , m} through the 
permutation is not {l,...,m} (see [D] ; B(l) = 1, B{2) = 1, B(3) = 3, B{A) = 13, 
B(5) = 71, . . . ). Hegarty Martinsson [2] call this T(n, 1) and show (Proposition 2.6) 
that B{n) ~ n\. Then 



LI E EX koVo° 



ael„ 



LI 



(2L - 2p - 2q - 2)1 B{L - p 

X : ~~ X 



x (1 — x) 



2L-p-q-2 



(L-p-q-l)l (2L-p-q-2)\ (L - p - q - 1)\ 

Take x = X/L and L large with p and q fixed. The terms on the right hand side are 
respectively equivalent to L p+q+1 , (2L)~ p ~ q , L and e~ 2X , so that 



1 



km - 2 L\ £ Ei ^do, L 



L— kx) L 2 



a£l P ,q 



-2X 



2'P+I ' 



and 



k-lk-l 



p=o <j=o «e/p, 9 



o 

0,L 



-2X 



4(1 _ 2 - fe + 1 + 4- fc ) 



Using (7.14) and (7.17) in (7.13), we finally get 



limsup-^E$[Var(#|J- fe )] < ^ 



~2X 



(7.16) 



(7.17) 



(7.18) 



IS 
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from which one gets (7.3) and (7.2) 



7.2. Second step: separating the start and the end of the hypercube. We go 

x a ) L - 2k - l l{x a + y T <l), (7.19) 



back to the expression Ok given in (7.1): 



°k=J2 E n a m T t(a ^ t)(L - 2k)(l - y T 

\a\=k \r\=L~k 



and we introduce the following slightly different quantity 

® k = E E n a m T L(l -y T - Xcr + x a y T 

\a\=k \r\=L-k 



,L-2fc-l 



(7.20) 



(Compared to Ok, this one has no l(cr f* r), no t(x a + y T < 1), a factor L instead of 
L — 2k and an extra x a y T in the power.) Clearly, Ok < 9^. Furthermore, we know that 
W[0 k ] = W[9) = L(l - x) 1 " 1 so that 



x 

lim Ei 



®k_ 

T 



-X 



Let us compute the same expectation for Ok- Using 

E x {n a \x a ) = k(x a - x) k - l t{x a > x), E x (m T \y T ) = k{y T ) k -\ 
one gets 

rl 
/0 

Ll{L-2k-l)l ' 



E d 



/ dx a / dy T k(x a - x) k 1 k(y T ) k x (l - y T - x a + x a y T ) 
Jx Jo 



(L - k)l(L - k - 1)1 



1-x 



,L-k-l 



so that 



x_ 
lim Ei 






(7.21) 



(7.22) 



L-2fc-l 



(7.23) 



(7.24) 



Finally, Ok/L — Ok/L is a non-negative random variable with an expectation going to 
zero; it thus converges to zero in probability. Therefore, in the L — > oo limit by Slutsky's 
theorem, Ok/L and Ok/L have the same distribution as soon as one of the limits exists. 
It now simply remains to notice that 



L 



EMl-^) 1 " 2 *" 1 E rn T (l-y n 

\a\=k ' ^\r\=L-k 



\L-2k-l 



(7.25) 



which means that Ok/L can be written has a contribution coming from the k first steps 
of the hypercube times an independent contribution coming from the k last steps. The 
contribution from the start depends on the value x of the origin. By symmetry, the 
contribution from the end has the same law as the contribution from the start with x = 0. 
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7.3. Third step: the start of the hypercube is like a tree. We now focus on the 



first term in (7.25) 



<j> k = J2 n a (l-x a ) L - 2k -\ (7.26) 

\a\=k 

The goal is to show that for a starting point x = X/L, in the large L limit then in the 
large k limit, this (f)^ converges weakly to e~ x times an exponential distribution. Our 
strategy is to compare <f) k (defined on the first k levels of the hypercube) to the 0^/L of 
the tree by showing that in the L — > oo limit the two quantities have the same generating 
function. 

The difficulty, of course, is that one cannot write directly a recursion on the generating 
function of 0& as we did on the tree because the paths after the first step are not 
independent. To overcome this, we introduce another quantity 4>k(b) which is (in a 
sense) nearly equal to 0^: 

Mb)= Y, n a {b){l-x a ) L , (7.27) 

M=fc 

where we will shortly explain the meaning of the parameter b and give the definition of 
n ff (b). For now, let us just say that h a {b) < n a \ in other words, we discard some open 
paths when computing <f>k(b). It is clear that 

Mb) < ^ (7.28) 

and we will choose n a {b) in such a way that 

lim Et [4> k {b)} = lim Et [<f> k ] . (7.29) 



With the same argument as before, (7.28|) and (|7.29|) will be sufficient to conclude that 



if linii^oo 4>k(b) exists (we will show it is the case), then lim£_ s . 00 </>& exists as well and 
has the same distribution. Then, we will be able to write a recursion for the generating 
function of <f>k(b) and solve it in the L — > oo limit. 

Before going further, let us recall the following standard representation of the hyper- 
cube: to each node of the hypercube, we associate a different binary word with L bits 
(digits) in such a way that the starting point is (0, 0, ... , 0), the end point is (1, 1, . . . , 1) 
and making a step is changing a single zero into a one. A node a at level k has a label 
with exactly k ones. 

We can now define b and h a (b). The parameter b is a set of forbidden bits. Any path 
going through any bit in b is automatically discarded. In other words, n a {b) = if a has 
any bit equal to 1 which is in b. The parameter h a (b) is 1 or 0, depending on whether 
there is an "interesting" path or not to a. An interesting path is defined recursively in 
the following way: 

• From the origin, we consider which nodes amongst the L — \b\ reachable first level 
nodes have a value which is smaller than (lnL)/L; these are the "interesting" 
nodes at first level, and only the paths going through these interesting nodes are 
deemed interesting and are counted in n a . 

• Let b' be the bits corresponding to all the interesting nodes at first level. After 
the first step, these b' bits are now forbidden for all interesting paths. 
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• Given the forbidden bits, the region of the hypercube reachable from each inter- 
esting node at first level is a sub- hypercube of dimension L — \b\ — \b'\. All these 
hypercubes are non-overlapping. The construction of the interesting paths from 
each first level interesting node is now done recursively in the same way on each 
corresponding sub-hypercube. 

Notice that by construction h a (b) = if x a > (In L)/L. This is a small price to pay 
as we expect that only the x a of order 1/L contribute. Furthermore, at each step we 
exclude O(lnL) bits. For each open paths, at step k, there will therefore be kO(lnL) 
forbidden bits. This is very small compared to L and will become negligible in the large 
L limit. 

The definition of n a (b) leads directly to a recursion on 4>k(b): 

0fc(6, starting point = x) = Y^ t(x < Xp)0^_ 1 (6 U b' , starting point = x p ), (7.30) 

pdb> 

where b' is the (random) set of interesting first level nodes, those with a value smaller 
than (In L)/L which avoid the b forbidden bits. Given b' , for each bit p G b' , 4>^\ is an 



independent copy of the variable defined in (7.27) with a different starting point. The 
recursion is initialized by 

Mb) = (l-x) L , (7.31) 

which is non-random and independent of b. 

Before computing the expectation and the generating function, remark that the dis- 
tribution of 4>k(b) depends only on the number |6| of forbidden bits, not on the bits 
themselves. We will abuse this remark and consider from now on that in the expression 
^• x [<j>k(b)], the parameter b is actually the number of forbidden bits. 

Let us now compute the expectation of 4>k(b). The distribution of the number b' of 
interesting nodes is binomial and we call p(b') its law: 

P(6 H v hr) \ 1 --r) ■ (732) 



Then from (J7.30 ) 

"' Ldy 

9{0 ) X I 
b'=0 

We will show by recurrence that the dependence in b can be written as 



E*[&(&)] = £p(&') x b' f L ^ E^ fe _! (6 + &')]• (7.33) 



EX [Mb)] = {L !l L b _% L k Mx,L). (7.34) 



It is obvious from (7.31 ) that this works for k = 0. Assume that it works at level k — 1. 
Then 



«■[*.<»] =jhj:r(> \ L i;:tz> +1 y «L ' m**-^ L >- (T3b) 
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The sum on b' decouples from from the integral and can be computed; one finds 



L-b 



sr M\ (L-b-vy. ,, 



6'=0 



(L-b)l InL / InL 
(L-b-k)\~L~ \~~1T 



fe-i 



and one recovers (7.34) with 

ipk(x,L) ■■ 

or 

X 



InL 
~L~ 



k-l 



LdyVfc-i(y,L) 



1pk[-jr,L 



^frr^A^ 



(7.36) 



(7.37) 



(7.38) 



From here and ipo{x, L) = (1 — x) , it is straightforward to show by recurrence that 

MX/L,L)<e~ x . (7.39) 

Then, with this bound and the dominated convergence theorem, the limit of the integral 
in (7.38) is the integral of the limit and one shows by another straightforward recurrence 
that lim^oo ip k (X/L, L) = e~ x . 



Going back to (7.34), one then gets for any function b(L) such that b(L) = o(L) 



E * [Mb)] < e , lim E^ [&(&(£))] = e 

L—>oo 



-X 



(7.40) 



This completes the proof that 4>k(b) and 4>k have the same distribution in the L — > oo 
limit if that limit exists. 

We now compute the distribution of <j>k(b) by writing a generating function. For /U > 0, 
let 

G k (jJL,x,L,b) = E x [exp (- /^ fc (6))]. (7.41) 



(Here again, we consider that the parameter b of G/% is a number.) From (7.30), 



G k (fi,x,L,b) = J2p(b') 
&'=o 

L-b 



L 

mT 



InL \ 

+ / " dyG k -i(/j,,y,L,b + b') 



■\V 



So 



X 



G k U, T ,L,b) = Y,P& 



L 



b'=0 



L-b 



L 
InL 



InL 
L 



dy 1 -G fe -i( / u,y,L,5 + 6 / ) 



b'=0 



1 



InL 



InL 



dy 



X 



I - -G,.._ [fi,j,L,b + b' 



(7.42) 



(7.43) 



If the Gfc_i(. • •) on the right hand side did not depend on b' , one could compute exactly 
the sum on b' . We will write bounds on G k -\ using quantities that do not depend on b' 
and compute this sum. 

To do this, remark that G k is an increasing function of b. Indeed, as we forbid more 
bits {b increases), we close more open paths, 4>k{b) decreases (or remains constant) and, 
from (7.41), G k increases. 
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Therefore, a lower bound is easy: Gfc_i(/x, Y/L, L,b + b') > Gk-i(fJ-, Y/L, L, b) and 

L-b 



G k [ p,—,L,b) > 



1-i 
Ljx 



InL 



dY 



I - G k -i[ n,—,L,b 



(7.44) 



To obtain an upper bound, we use the fact that according to p, the probability that 
b' is larger than In L is very small. Then, in (7.43), we cut the sum over b' into two 
contributions. In the first part b' runs from to [hi L\ and in the second part it 
runs from [In 2 L\ + 1 to L — b. In the first part, we write Gj--i(^,Y/L,L,b + b') < 
Gfc_i(/i, Y/L, L,b + [In L\ ) and extend again the sum to L — b. In the second part we 
write that the term multiplying p(b') is smaller than 1. Hence 

L-b 



X 



Gk[p,—,L,b) < 



1 



InL 



dY 



Y 



I -G k - 1 (n,j,L,b+ Lin 2 L\ 



(7.45) 



1 
LJx 

L-b 

+ £ P(b'). 

b' = [hi 2 LJ+l 

The remaining sum is of course the probability that b' is larger than In L, which is 
vanishingly small as b' is binomial of average and of variance smaller than In L. 

We can now show that Gk(p>,X/L,L,b) has a large L limit by recurrence. More 
precisely, we will show that for any function b{L) which is a o(L), 



~ / X 

G k {n,X)\= lim G k (p,,—,L,b(L) 

L— >oo \ L 

exists and is independent of b(L). 

This is obvious for k = as Go([i, x, L, b) = exp[— p,(l — x) L ], so that 

G (/j,, X) = exp [ - fie~ x ] . 



(7.46) 



(7.47) 



Suppose that (7.46) holds up to level k — 1. Then for any function b(L) = o(L), 
the function b(L) + [hr L\ is also an o(L). We know from (7.41) and (7.40) that 
Gk(/J.,X/L,L,b) > 1 — jMi^ [4>k{b)\ > 1 — ^e~ x , so that we can use the dominated 
convergence theorem and obtain 



/•InL 

lim / dY 

L-s-oo Jx 



1 



G k -i[ n,—,L,o{L) 



x 



dr[i-G fe _iGu,y)]. 



(7.48) 



It is then straightforward from (7.44) and (7.45) to see that (7.46) holds at level k and 
that 



G k (n,X) =exp 



x 



dY [1 - G k -!(n,Y)] 



(7.49) 



Equations (7.47) and (7.49) are the same as (4.15), which completes the proof. 
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